# Spatio-Temporal Feature Extraction
Vivit B 16x2 Kinetics400 UCF Crime Finetuned AbnormalVideosOnly
MIT
This model is a video classification model based on the ViViT architecture, specifically fine-tuned for anomaly video detection tasks
Video Processing
Transformers

V
Prabesh06
15
0
Videomae Base Finetuned Ssv2
VideoMAE is a video self-supervised pretraining model based on Masked Autoencoder (MAE), fine-tuned on the Something-Something-v2 dataset for video classification tasks.
Video Processing
Transformers

V
MCG-NJU
951
6
Featured Recommended AI Models